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Nonlinear  Filtering: 

Stochastic  Analysis  and  Numerical  Methods 

1  Statement  of  the  Problem 

The  project  was  concerned  with  the  analysis  of  partially  observed  stochastic  differential 
systems  and,  in  particular,  with  numerical  methods  in  nonlinear  filtering.  The  major 
objective  was  to  develop  numerical  methods  that  could  be  used  in  a  variety  of  state 
estimation  problems  and  in  associated  statistical  problems.  The  research  was  conducted 
by  a  team  of  investigators  with  expertise  in  nonlinear  filtering,  SPDE’s,  numerical  analysis, 
and  statistics. 

The  thrust  of  the  research  during  the  reporting  period  was  in  numerical  aspects  of  non¬ 
linear  filtering,  in  particular  in  development  of  numerical  approximation  schemes  suitable 
for  applications  in  real  time  target  tracking. 

Filtering  (estimation  of  a  “state  process”  from  noisy  observations)  is  a  classical  problem 
in  the  statistics  of  stochastic  processes.  It  is  of  central  importance  in  navigation,  image 
and  signal  processing,  control  theory,  automatic  tracking  systems,  and  other  areas  of 
engineering  and  science.  Filtering  is  one  of  the  exemplary  areas  where  the  application 
of  modern  stochastic  analysis  and  stochastic  numerics  lead  to  substantial  advances  in 
engineering. 

Target  tracking  and  identification  is  one  of  the  main  application  of  nonlinear  filtering 
and  will  be  emphasized  in  examples  below. 

The  desired  solution  of  the  filtering  problem  is  an  algorithm  that  provides  the  best 
mean  square  estimate  of  the  given  functional  of  the  state  process.  In  many  apphcations 
(most  notably,  battlefield  target  tracking)  real  time  implementation  of  such  algorithm  is 
an  important  requirement.  In  the  Gaussian  case,  the  optimal  (Kalman)  filter  [39]  meets 
this  requirement.  The  real  time  implementation  of  Kalman  filter  is  readily  available 
even  for  higher  dimensional  state  processes  due  to  the  fact  that  in  the  Gaussian  case  the 
posterior  distribution  admits  finite-dimensional  sufficient  statistics.  There  is  a  handful 
of  special  nonlinear  situation  when  existence  of  finite-dimensional  sufficient  statistics  for 
filtering  density  remains  to  be  the  case.  However,  in  general  this  is  not  true,  and  one  has 
to  deal  with  infinite-dimensional  posterior  distributions  (see  section  2.1). 

The  customary  way  to  address  this  problem  is  to  compute  the  nonlinear  filtering 
density  by  solving  the  Kushner  or  Zakai  equations  or  in  the  case  of  discrete  observations, 
the  Fokker-Planck  equation.  As  we  show  below,  it  is  impossible  to  obtain  corresponding 
solutions  in  real  or  almost  real  time  using  direct  methods.  Thus,  special  techniques  are 
needed. 
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2  Summary  of  Results  and  Accomplishments 

The  main  directions  of  the  research  and  accomphshments  of  the  project: 

•  Development  of  a  spectral  approach  to  optimal  nonlinear  filtering  and  related  ef¬ 
fective  numerical  approximations  to  the  optimal  nonlinear  filter  based  on  Wiener 
Chaos  Expansion; 

•  A  complete  solution  of  “the  last  Wiener  problem”  -  development  of  a  Wiener  type 
optimal  nonlinear  filter; 

•  Development  of  approximation  methods  for  nonlinear  filtering  based  on  the  projec¬ 
tion  filter  and  the  assumed  density  filter; 

•  Nonlinear  filtering  with  distributed  observations; 

•  Clutter  removal  in  nonlinear  filtering  for  imaging  data; 

•  Martingale  problems  for  stochastic  PDE’s; 

•  Spectral  approach  to  parameter  estimation  in  SPDE’s; 

•  Exponential  forgetting  of  the  initial  condition  for  nonlinear  (prediction)  filters  and 
their  derivatives  with  respect  to  some  parameter; 

•  Geometric  ergodicity  for  the  extended  process  whose  components  are  the  unobserved 
state  process,  the  observation  process,  the  prediction  filter,  and  its  derivative; 

•  Large  time  asymptotics  (consistency,  asymptotic  normality)  of  parameter  estimators 
in  partially  observed  systems. 

2.1  Numerical  Approximations  to  the  Optimal  Nonlinear  Filter 
Based  on  Wiener  Chaos  Expansions 

The  case  where  the  state  process,  Xt,  evolves  in  continuous  time,  while  the  observations, 
Yt,  are  made  in  discrete  moments  i  =  4,  A:  =  0, 1, . . .,  is  perhaps  the  most  interesting  case 
for  apphcations.  However,  since  in  the  case  of  continuous  observation  separation  of  para¬ 
meters  and  observations  is  a  more  delicate  problem,  we  present  the  results  for  this  more 
difficult  case.  The  approach  is  based  on  the  technique  of  Wiener  Chaos  Expansions  [62]. 

To  be  specific,  let  us  consider  the  problem  of  estimation  of  a  function  of  the  state 
process,  f{Xt),  at  moment  t  based  on  observations  Y*  =  {ys,s  <  t}  (i.e.,  filtering) 
assmning  that  the  signal  and  observation  processes  are  described  by  the  Ito  stochastic 
differential  equations 


dXt  =  b(Xf)dt  -j-  (T(Xt)dWt,  Ao  —  xq  , 
dYt  =  hiXt)dt  +  dVt 


(1) 

(2) 
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where  Wt  and  V*  are  independent  standard  Brownian  motions,  t  G  i?+  =  [0,  oo).  The  re¬ 
sults  may  be  extended  to  the  case  (important  for  some  applications  with  “passive  noises” 
like  clutter,  e.g.  IRST  systems,  see  section  2.3)  where  both  the  system  noise  and  mea¬ 
surement  noise  are  correlated  processes  and  are  correlated  between  each  other.  However, 
here  for  the  sake  of  simplicity  we  consider  simpler  (still  general  enough)  “uncorrelated” 
models  (1),  (2). 

It  is  well  known  that  the  optimal  mean  square  filtering  estimate  ft  =  E[f(Xt)\Y*]. 
Define  the  process 

At  =  exp  |j^  h{Xs)dYs  -  ^  ^  |h(Xs)pds| 

where  Xs  is  a  solution  of  the  Ito  equation 

dXt  =  b(Xt)dt  -j-  (T(Xt)dAVt,  Xq  =  xq  . 

Also,  define  the  new  probability  measxrre  P  by  dP  =  A^’^dP  and  by  E  denote  the  operator 
of  mathematical  expectation  with  respect  to  this  measure.  Note  that  the  measure  P  is  a 
Wiener  measure  and  the  process  At  may  be  interpreted  as  a  likelihood  ratio  process  for 
the  “signal-t-noise”  and  “noise  only”  models.  Then  usual  Bayesian  argument  yields 

/t  =  E[/(A:t)At|y‘]/E[At|y‘] 

where  <pt[g]  =  E3[^(Xt)At|T*]  is  the  so  called  rumormalized  optimal  filter. 

Under  very  general  assumptions,  this  filter  is  of  the  form 

E[/(Xt)Atiy*]  =  /  /  {x)pt{x)dx, 

where  the  conditional  probability  density  pt{x),  usually  referred  to  as  the  unnormalized 
filtering  density  (UFD),  can  be  characterized  as  the  imique  solution  of  the  following  SPDE 
(Zakai  equation): 

dpt{x)  =  Cpt{x)  dt  +  'f^  hkPt{x)  dY^  ,  pq{x)  =  7r(x)  (3) 

ife=i 

(see  [15],  [53],  [64],  [59]).  Here  we  used  the  notation 

d  q2  ^  .  d 

r*  —  Y'  _ 'l - YF— 

dxtdxj 

for  the  operator  adjoint  to  the  partial  differential  operator  associated  with  (1)  with  co- 
variance  matrix  a  =  (a’’^)  =  (l/2)crcr*  (d  and  r  are  the  dimensions  of  vectors  Xt  and  Yt, 
respectively),  n{x)  is  a  “pdf”  for  the  initial  condition.  We  also  assumed,  without  loss  of 
generality,  that  the  covariance  matrix  of  measurement  noise  is  a  unit  matrix. 
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Thiis,  the  optimal  nonlinear  filtering  problem  reduces  to  computing  nonlinear  filtering 
density  by  solving  the  Zakai  equation  (3)  or  the  Kushner  equation  (see  [28]-[30])  or  in  the 
case  of  discrete  observations,  the  Fokker-Planck  equation. 

Many  applications,  such  as  target  tracking,  require  to  solve  these  equations  on-line, 
which  is  the  most  formidable  obstacle  on  the  way  to  real  time  implementation  of  optimal 
nonhnear  filters.  This  problem  is  plagued  by  the  curse  of  dimensionality.  Specifically,  if 
the  spatial  dimension  d  >  3,  then  the  computational  complexity  of  solving  the  second 
order  parabohc  PDE’s  like  Fokker-Planck  equation  or  stochastic  PDF’s  as  Kushner’s  or 
Zakai’s  equations  becomes  too  high  for  direct  on-line  implementation. 

On  the  first  stage  of  our  research  supported  by  this  grant,  we  proposed  a  spectral 
approach  to  nonlinear  filtering,  which  allows  us  to  circumvent  the  aforementioned  com¬ 
putational  difficulties  by  separating  parameters  and  observations  and  shifting  these  time 
consuming  operations  off  line  (see  [41]-[45],  [49]).  In  the  resulting  algorithm,  the  on-line 
computations  can  be  organized  recursively  in  time  and  are  relatively  simple  even  when  the 
dimension  of  the  state  process  is  large.  Moreover,  certain  functionals  of  the  state  process 
can  be  estimated  without  computing  the  uimormalized  filtering  density.  For  the  contin¬ 
uous  time  case,  the  approach  is  based  on  the  Wiener  Chaos  Expansions  and  is  explained 
below. 

The  main  idea  of  the  Wiener  Chaos  approach  is  to  represent  the  tmnormalized  filtering 
density  Pt(x)  in  the  form 

Pt(^)  =  53 

a 

In  this  formula  r  is  an  arbitrary  point  of  the  interval  [0,t),  the  functions  ^"(r,t)  are 
normalized  Wick  polynomials  (products  of  Hermit  polynomials)  of  the  Wiener  integrals 
J^mk{s)dYs,  where  ,  k  =  1,2,---}  is  a  complete  orthonormal  system  (CONS)  in 
L2{0,t),  4>°‘{r,t,x)  are  deterministic  Fourier  coefficients  in  the  Cameron-Martin  orthogo¬ 
nal  decomposition  of  pt{x)  and  the  sununation  in  (4)  is  over  all  multi-indices  a. 

It  is  also  proven  in  [40],  [43],  [49],  [51],  [52]  (and  this  is  the  central  part  of  the  result) 
that  <f)f{x)  satisfies  a  recmrent  system  of  Kolmogorov-like  equations.  This  system  has 
an  especijdly  simple  form  if  the  observation  process  Yt  is  one-dimensional.  In  this  case, 
given  a  multi-index  a  =  {ai,  a2)  •  * ') 

^0“(r,s,a:)  =  /I*  0“(r,s,x) -1- 53Q!fc’T^fc(s)  ^''(a?)  ,s>r 

(5) 

4P{r,r,x)  =  Pr  (a:)  l{ia|=o} 
where  C*  is  the  same  operator  as  in  (3),  and 

a{k)  =  (ai,  Q!2,  •  •  • ,  otk-i,  -  1,  otk+i ,  •  •  •)  . 

Below,  system  (5)  is  referred  to  as  S'-system. 
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The  Wiener  Chaos  expansion  for  UFD  (4)  by  itself  does  not  provide  yet  the  desired 
effect  of  separation.  Indeed,  on  every  time  step,  a  solution  of  the  5— system  depends 
on  the  previous  observation  via  its  initial  condition,  and  so  could  not  be  solved  off  line. 
To  achieve  complete  separation,  we  consider  the  spatial  Fourier  coefficients  of  the  UDF, 
(i)  ~  fn/i  Pt  (^)  (®)  ^  ~  1,2,.... 

Let  0  =  to  <  <  •  •  •  <  =  T’  be  a  miiform  partition  of  the  interval  [0,  T]  with  step 

A  (so  that  ti  =  iA,  i  =  0, . . .  ,M).  Let  {m^}  =  be  a  CONS  in  Z/2([ti_i,ti]). 

It  was  shown  in  [43]  that  the  Fourier  coefficients  (t)  satisfy  the  following  recmsive 
equation: 

V’/(0)  =  /  p{x)ei{x)dx,  i)i{k)  =  '^Qin{k)iinik  -  1),  A:  =  1,2,...  (6) 

jR<i  n 

where 

'tpl{k)  =  Qln{k)  =  (tfc_l,tfc),  ^lan  —  f  ^ 

a 

and  where  S2,ei  is  a  solution  of  the  5-system  (5)  subject  to  the  initial  condition  0"(0, 0,  x)  = 

6/  (x)  l{|a|=0}- 

Note  that  the  kernel  F  is  the  only  term  in  (6)  that  requires  solving  PDF’s.  This  term  is 
defined  only  by  the  parameters  (the  coefficients)  of  the  state  and  observation  processes  (1), 
(2),  and  by  the  complete  orthonormal  system  {e/},  and  so  it  can  be  precomputed  off  line. 
On  the  contrary,  the  Wick  polynomials  (ti-i,ti)  are  defined  only  by  the  observation 
process  {Yt ,  U-i  <  t  <  U}  and  the  chosen  orthonormal  system  Hence  (6) 

possesses  the  desired  separation  property. 

The  algorithm  may  be  summarized  as  follows: 

1.  Before  the  observations  become  available 

a)  compute  ^/(O)  :=  (p,ej)  and  fi  =  {f,ei)  where  p  is  the  initial  density; 

b)  compute  S%^ei,  a  solution  of  the  S— system  (5)  subject  to  the  initial  condition 
(f)°‘{0,0,x)  =  ei  (x)  l{|a|=o}; 

c)  compute  Tian  =  /ijd  Cn  (a:)  S^ei  {x)  dx. 

2.  When  the  observation  become  available  (on  the  k-th  step),  A:  =  1,2, . . . 

a)  compute  (4-i)  4); 

b)  compute  i}i{k)  =  T,aT,n'4’n{k  -  l)rjQ„^“ (4-1,4); 

c)  compute  the  UFD  and  the  optimal  filter, 

PtK  =  IZV’/(fc)e/(aj)  (7) 

I 

ftk  =  (8) 
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We  refer  to  this  algorithm  as  the  spectral  separating  scheme  (S^).  Of  course,  to  make 

computable  one  has  to  truncate  all  the  involved  infinite  series.  The  convergence  of  the 
resulting  approximation  algorithm  and  a  thorough  error  analysis  has  been  done  in  [43], 
[51],  Here  we  only  mention  that  even  in  the  case  of  low  order  approximation  based  on 
Wick  polynomials  of  order  la]  =  2  using  only  one  element  of  the  basis  {rrik}  the  mean 
square  error  of  approximation  for  the  UFD  and  the  optimal  filter  is  of  the  order  of  A^. 

With  the  appropriate  choice  of  bases  the  on-line  computational  complexity  of  is 
linear,  i.e.  the  munber  of  operations  per  step  is  O  (Nd)  where  Nd  is  the  total  number  of 
spatial  points  (see  [42]). 

We  would  like  to  stress  the  following  features  of  the  algorithm: 

(1)  The  time  consuming  operations  of  solving  the  partial  differential  equation  (5)  and 

computing  integrals  are  performed  off  line; 

(2)  The  overall  amount  of  the  off-line  computations  does  not  depend  on  the  number  of 

the  on-line  time  steps; 

(3)  Formula  (8)  can  be  used  to  compute  an  approximation  to  ft  (e.g.  conditional  mo¬ 

ments)  without  the  time  consuming  computations  of  the  filtering  density  pt{x)  and 
the  related  integrals; 

(4)  Only  the  Fourier  coefficients  must  be  computed  at  every  time  step,  while  the 
approximate  filter  ft  and/or  the  filtering  density  pt{x)  can  be  computed  as  needed, 
e.g.  at  the  final  time  moment. 

Based  on  the  principle  of  separation  of  observations  and  parameters,  we  developed 
a  family  of  fast  stochastic  numerical  algorithms  for  nonlinear  filtering  (see  [42]-[44]). 
These  numerical  schemes  were  implemented  and  successfully  tested  in  several  problems 
of  target  tracking,  including  tracking  of  acutely  maneuvering  targets  with  angle-only 
nieasurements,  fusion  of  kinematic  (radar)  tracking  and  imaging,  etc. 

2.2  Nonlinear  Filtering  with  Distributed  Observations 

Filtering  of  a  signal  with  distributed  observation  is  one  of  the  most  important  and  at 
the  same  time  most  challenging  problems  of  signal  and  image  processing.  A  distinctive 
feature  of  this  particular  problem  is  that  the  observation  is  a  sequence  of  random  fields 
rather  then  a  random  process. 

It  was  explained  above  that  the  main  difficulty  in  practical  implementation  of  nonlinear 
filters  and  predictors  is  their  computational  complexity.  The  main  bulk  of  computations 
involved  in  nonlinear  filtering  comes  from  solving  associated  PDF’s  (Zakai  or  Kushner 
equations  if  the  observation  process  is  continuous  in  time  and  Fpkker-Planck  equation  if 
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it  is  discrete).  In  the  case  of  distributed  observation  the  problem  becomes  even  harder 
due  to  high  dimensional  measurements. 

The  spectral  approach  to  nonlinear  filtering  developed  by  the  PI  and  his  collaborators 
in  the  setting  with  low  dimensional  observation  can  be  extended  to  the  case  of  distributed 
measurements.  Moreover  we  argue  that  in  addition  to  being  temporally  recursive,  the 
resulting  algorithm  is  also  spatially  recursive.  More  specifically,  it  can  be  shown  that  if 
at  some  point  additional  measurements  become  available,  then  they  can  be  incorporated 
into  the  optimal  filter  without  recomputing  the  latter  from  scratch.  This  property  is  very 
important  since  it  allows  to  perform  sequential  multi-resolution  filtering. 

Example.  An  important  practical  motivation  for  the  above  setting  is  a  problem  of  deter¬ 
mining  the  position  of  a  dim  target  moving  in  a  plane  or  in  3D  space,  using  a  sequence  of  noisy 
images  of  the  region  of  the  space  in  which  it  evolves  (see  Figure  1). 

If  the  signal-to-noise  ratio  (SNR)  is  large  enough,  the  target  could  be  localized  on  a  single 
image  using  the  well  developed  theory  of  matched  filters  (see  Reed  et  al  [58]).  The  problem  of 
detection  becomes  much  more  difficult  if  localization  of  the  target  on  a  single  image  is  impossible 
or  at  least  hard  and  fraught  with  ambiguity  (for  example,  the  target  may  not  be  visible  at  all  on 
any  single  frame,  see  Figure  2).  In  the  latter  case  one  has  to  align  successive  frames  according 
to  typical  patterns  of  target  dynamics.  If  the  alignment  is  done  properly  the  signals  of  the 
various  images  would  add  up  and  produce  a  “spike”  with  a  sufficiently  large  SNR  while  the 
noises  would  cancel  out.  This  approach  to  detection  of  a  dim  target  is  usually  referred  to  as 
“tracking  before  detection”  (TBD).  Unfortunately  the  alignment  of  successive  frames  necessary 
for  TBD  is  extremely  difficult  in  the  case  of  an  acutely  maneuvering  noncooperative  target. 

To  counter  the  aforementioned  difficulty,  an  algorithm  for  frame  alignment  based  on  optimal 
nonlinear  filtering  and  prediction  was  proposed  in  [26].  The  results  of  implementation  of  the 
spatial-temporal  nonlinear  filtering  algorithms  with  the  use  of  Haar  basis  are  shown  in  Figure  3. 
The  target  is  completely  localized  after  30  frames  of  processing. 


2.3  Clutter  Removal  In  Nonlinear  Filtering  For  Imaging  Data 

The  majority  of  filtering  methods  used  for  target  tracking  and  detection  strongly  rely 
on  a  “signal-plus-noise”  (SPN)  assmnption  for  a  mathematical  model  of  initial  data. 
However,  the  original  sensor  data  typically  does  not  fit  such  an  assumption  due  to  the 
presence  of  clutter  components.  Thus,  an  important  issue  is  to  efiminate  the  clutter  and  to 
reduce  the  original  measurements  to  a  SPN-model.  To  accomphsh  this  goal,  we  consider 
nonparametric  statistical  methods. 

Nonparametric  regression  algorithms  can  be  regarded  as  methods  of  clutter  estimation, 
or  function  smoothing,  such  that  the  residuals  between  the  original  data  and  its  smoothed 
version,  or  estimate,  would  be  reasonably  approximated  by  SPN-models.  Kernel  methods 
provide  a  powerful  tool  for  such  analysis  due  to  both  computational  transparency  and 
asymptotic  optimality  in  various  settings  of  interest  for  observations  in  R‘^  with  arbitrary 
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d  >  1,  see  [17].  For  example,  if  2-D  observations  are  of  the  form 


Zij  =  f(xi,yj) +^ij,  Xi  =  i/ni,  j  =  j/n2,  i  =  1,  ■  ■  ■  ,ni,j  -  I, . . .  ,n2, 


where  f{xi,yj)  is  a  value  of  an  unknown  function  /  at  a  point  {xi,yj),  are  random 
variables  with  zero  means,  then  a  kernel  estimator  is  given  by 


f{xi,yj)  = 


i  -  h  j  -  k 

Ni  ’  N2 


), 


where  N\,N2  are  window  sizes  in  corresponding  directions,  K{x,y)  is  a  deterministic 
function,  or  kernel,  K  :  R?  ^  R},  such  that  J  K{x,y)  dxdy  =  1.  It  is  readily  seen  that 
kernel  estimators  are  weight^  moving  averages  of  observations. 

The  application  of  various  kernels  to  real  data  sets  is  discussed  in  [37]  along  with  the 
verification  of  “white  noise”  assumption  for  the  model  of  residuals.  The  latter  analysis 
relies  on  nonparametric  rank  methods  [25].  The  results  of  data  smoothing  and  clutter 
removal  are  shown  in  Figure  4  a,b. 


2.4  Martingale  Problems  for  Stochastic  PDE’s 

The  results  related  to  constructing  the  optimal  nonlinear  filters,  reported  above,  would 
not  have  been  possible  without  substantial  development  of  the  theory  of  stochastic  PDE’s. 
During  the  reporting  period  we  made  a  substantial  progress  in  this  crucial  area. 

Specifically,  we  studied  nonlinear  stochastic  PDE’s  with  non-smooth  (in  some  cases 
singular)  coefficients.  The  examples  include  stochastic  Navier-Stokes  equation,  Langevin 
(stochastic  quantization)  equation  in  P{ip)2  Euclidean  quantum  field  theory,  SPDE’s  for 
the  super-Brownian  motion  and  some  related  superprocesses.  We  concentrated  on  ex¬ 
istence,  imiqueness,  absolute  continuity  and  singularity  of  distributions,  and  ergodicity 
problems  for  these  equations. 

Various  classes  of  stochastic  differential  equations,  where  strong  solutions  do  not  exist 
or  where  their  existence  is  very  difficult  to  prove,  can  be  handled  using  the  martingale 
approach.  For  example,  the  martingale  approach  is  very  useful  in  situations  with  non¬ 
smooth  coefficients  typical  of  many  SPDE’s  arising  in  physics  and  other  sciences.  These 
include  the  majority  of  equations  obtained  as  limits  of  branching  particle  systems,  equa¬ 
tions  of  stochastic  hydrodynamics,  stochastic  quantization  equations  in  quantmn  field 
theory. 

Our  paper  [48]  deals  with  martingale  problems  in  topological  vector  spaces  and  their 
applications  for  stochastic  PDE’s.  In  particular,  we  derive  the  necessary  and  sufficient 
criteria  for  absolute  continuity  of  solutions  to  martingale  problems.  One  interesting  ap¬ 
plication  of  this  result  is  the  characterization  of  all  the  measures  absolutely  continuous 
with  respect  to  super-Brownian  motion.  We  also  apply  the  obtained  criteria  to  estab- 
hsh  uniqueness  and  existence  of  certain  classes  of  SPDE’s  including  quasi-hnear  SPDE’s 
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Obs:1 40x1 40x66,  Target:3x3,  SNR  -5.2dB 
Basis=Haar,  1 28x1 28,  CPU=0.66s/frame 
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Figure  3 :  Results  of  ofF-line/on-line  non-linear  filtering,  frame  30. 
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with  measurable  (but  not  necessarily  continuous)  “drift” .  In  the  same  paper  we  extended 
Viot’s  compactness  method  and  apphed  it  to  solving  nonhnear  SPDE’s.  Specifically,  we 
investigated  existence  of  weak  solutions  to  stochastic  Navier-Stokes  type  equations  and 
some  other  quasi-finear  SPDE’s  with  polynomially  growing  coefl&cients. 

In  [48]  we  also  addressed  the  P((p)2  stochastic  quantization  equation.  The  equation 
of  stochastic  quantization  was  originally  introduced  and  studied  by  physicists  (see  Parisi 
and  Wu  [56],  Jona^Lasinsio  and  Mitter  [24])  with  the  intent  to  bring  dynamics  in  P((p)2 
Euclidean  quantum  field  theory.  The  idea  was  to  demonstrate  that  this  equation  plays 
the  role  of  Langevin’s  equation,  in  that  its  solution  is  an  ergodic  Markov  process  whose 
xmique  invariant  measure  is  the  Euclidean  P(ip)2  measure.  Due  to  the  singular  nature 
of  the  nonhnear  term,  the  stochastic  quantization  equation  is  quite  challenging.  In  [48] 
we  proved  the  long  standing  conjecture  that  in  the  non-regular  case  the  distributions  of 
stationary  solutions  to  this  equation  and  the  equation  for  the  free  field  are  singular. 

2.5  Spectral  Approach  to  Parameter  Estimation  in  SPDE’s 

Another  important  issue  was  the  parameter  estimation  for  stochastic  evolution  equa¬ 
tion.  In  [18]  and  [46]  we  studied  parameter  estimation  for  randomly  perturbed  PDE’s. 
Specifically,  we  investigated  asjunptotic  properties  of  estimators  based  on  finite  number 
of  observable  Fourier  coefiicients  of  the  random  field  described  by  the  PDE  in  question. 
Necessary  and  sufficient  conditions  were  found  for  the  consistency,  asymptotic  normality 
and  efiiciency  of  the  estimates  when  the  number  of  spatial  modes  increases. 

These  results  were  applied  to  some  parametric  models  of  passive  scalars  transport 
by  turbulent  flow.  In  particular,  it  was  shown  that  the  diffusivity  exponent  in  the  sto¬ 
chastic  (advection-diffusion)  equation  for  heat  transport  is  always  consistent,  while  the 
consistency  of  the  feedback  parameter  estimate  depends  on  the  spatial  dimension.  Unlike 
previous  works  on  the  subject,  no  commutativity  is  assumed  between  the  operators  in  the 
equation. 

2.6  Projection  Filter 

Here  we  again  consider  a  filtering  problem  where  the  state  evolves  according  to  a  stochas¬ 
tic  differential  equation  (SDE),  and  the  objective  is  to  estimate  the  state  from  nonlinear 
observations  in  additive  Gaussian  white  noise  (see  (1),  (2)).  As  we  stated  above,  the 
filtering  problem  consists  in  the  calculation  of  the  whole  conditional  probabihty  distrib¬ 
ution  of  the  state  given  past  observations,  which  results  in  an  infinite  dimensional  filter. 
Under  some  regularity  assumptions,  the  conditional  probability  distribution  is  absolutely 
continuous  with  respect  to  the  Lebesgue  measure,  and  the  conditional  density  pt  is  the 
rmique  solution  of  the  Kushner-Stratonovich  equation,  a  stochastic  PDE. 

The  projection  filter  is  a  finite  dimensional  nonhnear  filter  based  on  the  differential 
geometric  approach  to  statistics.  In  Brigo,  Hanzon  and  LeGland  [6,  9]  the  projection  filter 
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was  particularized  to  exponential  families  in  the  framework  of  SDE’s  on  manifolds.  The 
projection  filter  is  defined  by  orthogonally  projecting  the  right-hand  side  of  the  Kushner- 
Stratonovich  equation  for  onto  the  tangent  space  of  a  finite  dimensional  manifold  of 
(square-root  of)  probability  densities,  according  to  the  Fisher  metric  and  its  extension  to 
infinite  dimensional  space  of  square  roots  of  densities,  known  as  the  Hellinger  distance. 
In  practice,  we  use  the  manifold  EM^/^(c)  associated  with  an  exponential  family  EM(c) 
defined  by  the  coefiicients  c  =  {ci,  •  •  • ,  Cm}-  Although  at  first  sight  the  resulting  equation 
may  look  hke  a  stochastic  PDE,  it  is  just  a  finite  dimensional  SDE  for  the  parameter  6t 
of  the  projection  filter  density  p(-,  ^t). 

Another  approximation  method  in  nonlinear  filtering  is  the  assumed  density  filter 
(ADF)  obtained  by  closing  the  equations  for  a  few  conditional  moments,  under  the  as¬ 
sumption  that  the  exact  conditional  density  is  of  a  given  form,  see  Kushner  [31].  In  the 
case  of  the  exponential  family  EM(c),  one  would 

1.  write  the  equation  for  the  evolution  of  the  conditional  c-moment  T]t  =  E[c(A()  |  Tt], 
and 

2.  evaluate  all  the  conditional  expectations  appearing  in  the  r.h.s.  of  the  equation  as  if 
the  conditional  density  was  the  density  in  the  exponential  family  imiquely  defined 
by  the  expectation  parameter  rjt. 

The  Ito-based  ADF  and  the  Stratonovich-based  ADF  are  different  filters  in  general, 
which  is  due  to  the  logical  inconsistency  that  is  inherent  to  the  ADF-concept:  selecting  a 
different  set  of  equations  to  which  it  is  applied,  leads  to  different  results. 

Main  results.  We  have  shown  in  Brigo,  Hanzon  and  LeGland  [9]  that  it  is  possible  to 
define  simplifying  exponential  families  EM(c*)  and  EM(c*)  such  that  the  corresponding 
exponential  projection  filter  has  the  following  important  properties: 

•  with  the  choice  EM(c*),  the  diffusion  coefficient  in  the  stochastic  differential  equa¬ 
tion  for  the  PF  parameter  9t  is  constant,  i.e.  the  equation  has  a  very  simple  form, 
and  it  is  possible  to  define  an  a  posteriori  estimate  of  the  local  error  resulting  from 
the  projection  filter  approximation, 

•  with  the  choice  EM(c*),  the  correction  step  in  the  nonlinear  filtering  algorithm  with 
discrete-time  observations  is  handled  exactly,  without  any  error. 

We  have  shown  in  Brigo,  Hanzon  and  LeGland  [8]  that  : 

•  for  any  exponential  family  EM(c),  the  projection  filter  coincides  with  the  Stratonovich- 
based  assumed  density  filter, 

•  with  the  choice  EM(c*),  the  Ito-based  ADF  coincides  with  the  Stratonovich-based 
ADF. 
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2.7  Nonobservable  Systems  Observed  in  Small  Noise 

We  have  also  studied  the  small  noise  asymptotics  of  the  Bayesian  estimator,  based  on 
continuous  time  nonlinear  regression  observation  in  additive  Gaussian  white  noise 

dXt  =  mt(0)dt  +  sdWf  . 

Using  the  Bayesian  approach,  we  model  the  a  priori  information  on  the- unknown  para¬ 
meter  ^  by  a  prior  probability  distribution,  and  the  posterior  probability  distribution 
is  defined  by  the  Bayes  formula.  Under  the  usual  identifiability  assumption  that  the  true 
value  a  of  the  parameter  is  the  only  minimxun  point  of  the  Kullback-Leibler  information 

Ka{e)  =  I  r  \mt{d)  -  dt  , 

J  0 

the  Bayesian  estimator  is  consistent,  i.e.  ^  6^  in  P®-probability  as  e  J.  0,  see  Kutoy- 
ants  [32].  However,  there  are  some  practical  situations  where  nonidentifiablilty  occurs, 
i.e.  where  the  set  Ma  of  minimum  Kullback-Leibler  information  does  not  reduce  to  {a}. 
In  this  case,  the  Bayesian  point  estimator  is  not  relevant,  and  we  are  rather  interested  in 
the  asymptotic  behavior  of  the  posterior  probability  distribution  as  e  J,  0. 

A  prototype  of  this  situation  is  the  following  problem.  Consider  a  nonlinear  filtering 
problem  with  noise-free  dynamics,  where  the  unobserved  process  evolves  according  to 
an  ODE  with  unknown  initial  condition,  and  the  observations  are  corrupted  by  a  small 
additive  Gaussian  white  noise.  If  the  limiting  deterministic  system  is  nonobservable,  then 
the  corresponding  statistical  problem  is  nonidentifiable.  Let  us  mention  the  TMA  as  a 
typical  application  where  nonobservability  occurs,  see  Levine  and  Marino  [38]. 

Main  results.  We  have  studied  the  case  where  the  set  Ma  of  points  with  minimum 
Kullback-Leibler  information  is  a  submanifold.  It  is  easy  to  show  that  asymptotically  as 
e  I  0  the  probability  distribution  p^  is  supported  by  M„.  We  have  shown  that,  using  first 
order  terms,  such  as  the  Fisher  information  matrix,  it  is  possible  to  characterize  the  limit 
as  £•  i  0  of  the  probability  distribution  p^,  as  a  random  probabihty  distribution  pa  on 
Ma,  absolutely  continuous  w.r.t.  the  canonical  measure  on  Ma,  and  to  provide  an  explicit 
expression  for  the  density. 

To  study  the  rate  of  convergence,  we  have  considered  the  posterior  probability  dis¬ 
tribution  of  the  normalized  deviation  [9  —  7r(0)]/e,  where  tt  denotes  the  orthogonal 
projection  on  the  set  Ma-  We  have  characterized  the  limit  as  £  i  0  of  the  probability  dis¬ 
tribution  as  a  mixture  Ua  of  random  Gaussian  probability  distributions  on  the  normal 
bundle  space  to  Ma- 

2.8  Exponential  Forgetting  and  Geometric  Ergodicity 

Consider  the  situation  where  the  state  sequence  {X„}  is  a  time-homogeneous  Markov 
chain  with  finite  state  space  S,  initial  probability  distribution  p,  =  (pi),  and  transition 
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probability  matrix  Q,  =  {q^).  It  is  assumed  that  only  observations  {F„}  are  available, 
which  are  mutually  independent  given  the  state  sequence,  with  conditional  densities  b,  = 
(6;).  In  other  words,  we  consider  a  typical  hidden  Markov  model  (HMM). 

The  prediction  filter,  which  is  the  probability  distribution  of  the  state  given  past 
observations  lo,  •  •  •  ,Yn-i,  solves  an  equation  with  values  in  the  set  P(S')  of  probability 
distributions  over  the  finite  set  S.  However  in  practice  the  initial  probability  distribution 
p„  the  transition  probabihty  matrix  Q„  and  the  vector  6.  of  observation  conditional  den¬ 
sities  -  which  define  the  model  P,  -  are  generally  unknown,  and  we  consider  instead  the 
equation  for  the  prediction  filter  {pn}  associated  with  a  wrong  initial  probability  distri¬ 
bution  po,  a  wrong  transition  probability  matrix  Q,  and  a  wrong  vector  b  of  observation 
conditional  densities.  Note  that  these  misspecification  issues  are  of  a  different  nature: 

•  we  expect  that  a  wrong  initial  condition  for  the  prediction  filter  is  rapidly  forgotten, 
so  that  we  could  use  any  initial  condition  with  practically  the  same  effect, 

•  on  the  other  hand,  we  expect  that  two  different  transition  probability  matrices, 
and  two  different  vectors  of  observation  conditional  densities  will  produce  two  sig¬ 
nificantly  different  observation  sequences,  so  that  we  could  estimate  the  unknown 
transition  probabihty  matrix  and  the  rmknown  vector  of  observation  conditional 
densities  accurately,  by  accmnulating  observations. 

Towards  identification  of  HMM’s,  we  consider  also  the  gradient  {wn}  of  the  prediction 
filter  {pn}  w.r.t.  some  parameter,  which  solves  a  linear  equation  with  values  in  the  set 
S  =  {tw  :  e*w  =  0},  which  is  the  hnear  tangent  space  to  (S).  Here  e  =  (1,  •  •  • ,  1)* 
denotes  the  vector  with  all  entries  equal  to  1. 

Main  results.  Under  the  assumption  that  the  transition  probabihty  matrix  Q  is 
primitive,  we  have  obtained  an  explicit  upper  boimd  for  the  difference  between  the  so¬ 
lutions  of  the  misspecified  prediction  filter  equation  starting  from  two  different  initial 
conditions.  We  have  also  obtained  an  exphcit  bound  for  the  Lipschitz  constant  of  the 
solution  map  associated  with  the  misspecified  prediction  filter  equation. 

These  two  non-logarithmic  and  non-asymptotic  boimds  go  to  zero  at  exponential  rate 
as  time  goes  to  infinity,  and  as  a  consequence,  under  the  additional  assumption  that 
the  true  transition  probabihty  matrix  Q,  is  primitive  as  weU,  we  have  obtained  an  up¬ 
per  boimd  for  the  P,-a.s.  exponential  rate  of  forgetting  of  the  initial  condition  for  the 
misspecified  prediction  filter  equation. 

We  have  obtained  similar  results  about  the  forgetting  of  the  initial  condition  for  the 
misspecified  hnear  tangent  prediction  filter. 

Using  the  estimates  on  exponential  forgetting,  we  have  proved  the  geometric  ergodicity 
of  the  extended  Markov  chain  {Z^  =  {Xn,Yn,Pn,yJn)} ,  under  some  mild  integrability 
assumption  on  the  vectors  6,  and  b  of  observation  conditional  densities. 
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As  a  consequence,  we  have  also  proved  the  uniqueness  of  an  invariant  measure,  and 
the  existence  of  a  solution  to  the  associated  Poisson  equation.  Prom  this  result,  the  law 
of  large  munbers,  and  the  central  limit  theorem  can  be  proved  for  the  extended  Markov 
chain  (Zn  ~ 


2.9  Asymptotic  Properties  of  the  MLE  and  the  CLSE 

We  consider  the  problem  of  HMM  identification  (i.e.  partially  observed  finite-state  Markov 
chain),  based  on  noisy  observations.  We  suppose  that  the  law  of  the  hidden  Markov  chain 
depends  on  some  imknown  finite-dimensional  parameter  6  E  0,  and  we  are  interested  in 
the  estimation  of  9. 

This  is  the  continuation  of  the  problem  presented  in  the  previous  section.  Along 
with  the  true  but  unknown  model  P„  with  initial  probability  distribution  p„  transition 
probability  matrix  Q,,  and  vector  6,  of  observation  conditional  densities,  we  consider  a 
parametric  model  {P® ,  0  G  0},  with  initial  probabifity  distribution  Po  p,,  transition 
probability  matrix  Qe,  and  vector  bg  of  observation  conditional  densities. 

We  assume  that  there  exists  a  true  value  a  G  0  of  the  parameter,  such  that  Q,  =  Qa 
and  b,  =  ba-  Since  the  initial  probabifity  distributions  p,  and  po  are  possibly  different, 
the  true  probabifity  P,  does  not  belong  in  general  to  the  family  {P®  ,6  G  0},  i.e.  the 
statistical  model  is  misspecified.  However,  it  follows  from  the  results  presented  above  that 
a  wrong  initial  probabifity  distribution  is  rapidly  forgotten,  so  that  from  an  asymptotic 
point  of  view,  P,  and  P“  are  practically  equivalent.  We  make  the  assmnption  that  for 
the  true  value  a  G  0,  the  transition  probabifity  matrix  Qa  is  primitive. 

In  this  parametric  model,  it  is  easy  to  show  that  many  functions  of  interest  for  the 
estimation  of  the  imknown  parameter  9,  e.g.  the  log-likelihood  function  or  the  conditional 
least-squares  functional,  and  the  gradient  of  these  functions  w.r.t.  the  parameter  9,  can  be 
expressed  as  additive  functionals  of  the  extended  Markov  chain  {Z^  =  (Xn,  Yn,pi,  dp^)}, 
where  {p®}  is  the  prediction  filter  corresponding  to  the  value  9  of  the  parameter,  and 
{5p^}  is  the  corresponding  linear  tangent  prediction  filter. 

Main  results.  Using  the  explicit  boimd  for  the  Lipschitz  constant  of  the  solution 
map  associated  with  the  prediction  filter  equation,  we  have  proved  that  the  log-likelihood 
function  and  the  conditional  least-squares  functional  are  Lipschitz  continuous  w.r.t.  the 
parameter,  uniformly  in  time. 

Using  the  existence  of  a  solution  to  the  Poisson  equation  associated  with  the  extended 
Markov  chain  and  the  resulting  law  of  large  numbers  and  central  limit  theorem,  we  have 
been  able 

•  to  obtain  an  explicit  expression  for  the  P,-a.s.  limit  of  the  log-likelihood  function 
(suitably  normalized),  i.e.  for  the  Kullbadc-Leibler  information,  and  to  prove  that 
it  is  miniTnum  in  the  true  value  a  of  the  parameter; 
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•  to  prove  the  asymptotic  normality  of  the  score  function  (suitably  normalized)  and 
to  obtain  an  explicit  expression  for  the  asymptotic  covariance  matrix,  i.e.  for  the 
Fisher  information  matrix. 

We  have  obtained  similar  results  for  the  conditional  least-squares  functional. 

These  results  allow  us  to  prove 

•  the  convergence  of  the  maximum  likelihood  estimator  and  the  conditional  least- 
squares  estimator  to  the  set  of  maxima  of  the  associated  contrast  function,  using 
the  Lipschitz  continuity  w.r.t.  the  parameter  of  the  log-Ukelihood  function  and  the 
conditional  least-squares  functional,  respectively; 

•  the  asymptotic  normality  of  the  maximum  likelihood  estimator  and  the  conditional 
least-squares  estimator. 

We  have  obtained  similar  convergence  results  for  recursive  versions  of  these  estimators, 
using  the  approach  of  Delyon  [13]  and  Delyon  and  louditski  [14]. 


3  Diffusion  of  Results 

The  results  of  our  research  on  nonhnear  filtering  and  SPDE’s  were  presented  by  B. 
Rozovskii  at  several  major  conferences;  these  include;  the  1996  SIAM  Annual  meeting 
(Kansas  City),  Fourth  World  Congress  of  the  Bernoulli  Society  (Vienna,  Austria),  the 
35*^  IEEE  Conference  on  Decision  and  Control  (Kobe,  Japan),  a  mini-course  on  Nonlin¬ 
ear  Filtering  (Special  Year  in  Stochastic  Analysis,  Mathematical  Science  Research  Insti¬ 
tute,  Berkeley,  1997),  invited  talk  at  the  International  Symposium  on  Stochastic  Control 
and  Nonlinear  Filtering  (Los  Angeles,  1997). 

Other  results  have  been  presented  by  Marc  Joannides  at  the  HCM  Workshop  on  Statis¬ 
tical  Inference  for  Stochastic  Processes,  Sonderborg,  Danemark,  April  29-March  3,  1996, 
and  at  the  Jomnees  SMAI  Modelisation  Aleatoire  et  Statistique  (MAS),  Toulouse,  Prance, 
September  23-25,  1996;  by  Prangois  LeGland  at  the  2nd  Portuguese  Conference  on  Au¬ 
tomatic  Control,  Porto,  Portugal,  September  11-13,  1996,  at  the  Workshop  on  Statistical 
Asymptotics  for  Continuous-Time  Stochastic  Processes,  LeMans,  Prance,  January  27-28, 
1997,  at  the  Workshop  on  Stochastic  Control  and  Nonlinear  Filtering,  Raleigh,  NC,  Oc¬ 
tober  11-12,  1996,  at  the  Seminaire  Signal-Image,  IRISA,  Reimes,  Prance,  November  28, 
1996,  at  the  International  Symposium  on  Mathematical  Theory  of  Networks  and  Systems 
(MTNS),  Saint  Louis,  MO,  Jtme  24-28,  1996,  at  the  36th  IEEE  Conference  on  Decision 
and  Control  (CDC),  San  Diego,  December  10-12,  1997  and  at  the  symposium  on  Sto¬ 
chastic  Control  and  Nonlinear  Filtering,  Los  Angeles,  December  13-15,  1997;  by  Laurent 
Mevel  at  the  workshop  on  Hidden  Markov  Models,  Evry,  Prance,  April  24,  1997,  and  at 
the  4th  European  Control  Conference  (ECC),  Brussels,  July  1-4,  1997. 
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7  Technology  Transfer 

The  developed  nonlinear  filtering  technology  (particularly,  track-before-detect  algorithm 
based  on  this  technology)  is  in  the  process  of  transfer  to  the  small  business  company  as 
well  as  to  the  TRW  Data  Technologies  Division  (Systems  Integration  Group)  imder  the 
1998  SBIR-STTR  program.  It  is  expected  that  the  developed  ideas  will  be  used  in  both 
radar  and  electro-optical  warning  systems  in  BMDO  programs. 
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Aegir  Systems  (Oxnard,  CA)  is  applying  our  numerical  algorithm  for  the  devel¬ 
opment  of  a  tracking  filter.  Testing  of  Aegir’s  tracker  indicates  that  the  advantage  of 
the  proposed  approach  is  especially  significant  for  angle-only  tracking  of  resolved  or  sub¬ 
resolved  targets  (IR,  EO  search  and  track  systems)  when  signal-to-noise  ratio  is  extremely 
low  (-3dB  to  -6dB). 


References 

[1]  A.  Arapostathis  and  S.I.  Marcus,  “Analysis  of  an  identification  algorithm  arising  in 
the  adaptive  estimation  of  Markov  chains,”  Mathematics  of  Control,  Signals,  and 
Systems,  vol.  3,  no.  1,  pp.  1-29,  1990. 

[2]  R.  Atar  and  O.  Zeitouni,  “Lyapunov  exponents  for  finite  state  nonlinear  filtering,” 
SIAM  Journal  on  Control  and  Optimization,  vol.  35,  no.  1,  pp.  36-55,  January  1997. 

[3]  J.S.  Baras,  “Real-time  architectures  for  the  Zakai  equations  and  applications, ’Tn 
Stochastic  Analysis,  (E.  Mayer-Wolf,  et  al.  Editors),  Academic  Press,  Boston,  pp. 
15-38,  1991. 

[4]  A.  Bensoussan,  R.  Glowinski,  and  R.  Rascanu,  “Approximation  of  the  Zakai  equation 
by  sphtting  up  method,” /S'/AM  Journal  on  Control  and  Optimization,  vol.  28,  pp. 
1420-1431,  November  1990. 

[5]  D.  Brigo,  Filtering  by  Projection  on  the  Manifold  of  Exponential  Densities.  Ph.D. 
Thesis,  Vrije  Universiteit,  Amsterdam,  November  1996. 

[6]  D.  Brigo,  B.  Hanzon,  and  F.  Le  Gland,  “A  differential  geometric  approach  to  nonhn- 
ear  filtering:  the  projection  filter,”  In  Proceedings  of  the  Sfth  Conference  on  Decision 
and  Control,  New  Orleans  1995,  pp.  4006-4011.  lEEE-CSS,  December  1995. 

[7]  D.  Brigo,  B.  Hanzon,  and  F.  Le  Gland,  “The  exponential  projection  filter  and  the 
selection  of  the  exponential  family,”  In  Proceedings  of  the  2nd  Portuguese  Conference 
on  Automatic  Control,  Porto  1996,  pp.  251-256.  APCA,  September  1996. 

[8]  D.  Brigo,  B.  Hanzon,  and  F.  Le  Gland,  “On  the  relationship  between  assmned  density 
filters  and  projection  filters,”  Discussion  Paper  TI  7-96-18,  Tinbergen  Institute, 
February  1996. 

[9]  D.  Brigo,  B.  Hanzon,  and  F.  Le  Gland,  “A  differential  geometric  approach  to  non¬ 
linear  filtering:  the  projection  filter,”  IEEE  Transactions  on  Automatic  Control,  vol. 
AC-43,  no.  2,  pp.  247-252,  February  1998. 

[10]  D.  Brigo,  B.  Hanzon,  and  F.  Le  Gland,  “Approximated  filtering  by  projection  on  the 
manifold  of  exponential  densities,”  Bernoulli,  4,  1998  (to  appear). 


ARO  Grant  DAAH04-95-1-0164  (CAMS,  USC) 


Nonlinear  Filtering  27 


[11]  Z.  Cai,  F.  Le  Gland,  and  H.  Zhang,  “An  adaptive  local  grid  refinement  method  for 
nonlinear  filtering,”  Publication  Interne  954,  IRISA,  October  1995. 

ftp : //ftp . irisa . f r/techreports/ 1995/PI-954 . ps . gz. 

[12]  F.  Campillo,  F.  Cerou,  F.  Le  Gland,  and  R.  Rakotozafy,  “Particle  and  cell  approxi¬ 
mations  for  nonlinear  filtering,”  Rapport  de  Recherche  2567,  INRIA,  June  1995. 
ftp : //ftp . inria . f r/INRIA/publicat ion/RR/RR-2567 . ps . gz. 

[13]  B.  Delyon,  “General  results  on  the  convergence  of  stochastic  algorithms,”  IEEE 
Transactions  on  Automatic  Control,  vol.  AC-41,  no.  9,  pp.  1245-1255,  September 

1996. 

[14]  B.  Delyon  and  A.  louditski,  “Stochastic  approximation  with  averaging,”  Publication 
Interne  952,  IRISA,  October  1995. 

ftp : //ftp . irisa.fr/techreports/1995/PI-952 . ps . gz. 

[15]  T.E.  Dimcan,  “Probabihty  densities  for  diffusion  processes  with  applications  to  non- 
hnear  filtering  theory,”  PhD  thesis,  Stanford  University,  1967. 

[16]  H.  Furstenberg  and  H.  Kesten,  “Product  of  random  matrices,”  The  Annals  of  Math¬ 
ematical  Statistics,  vol.  31,  no.  2,  pp.  457-469,  Jime  1960. 

[17]  W.  Hardle,  Applied  Nonparametric  Regression,  Cambridge  Univ.  Press,  1990,  Cam¬ 
bridge. 

[18]  M.  Hiiebner,  S.V.  Lototsky,  and  B.L.  Rozovskii,  “Asymptotic  properties  of  an  ap¬ 
proximate  mavirmiTn  likelihood  estimator  for  stochastic  PDF’s,”  In:  The  Liptser 
Festschrift,  Statistics  and  Control  of  Stochastic  Processes  (Ed.,  Yu.  Kabanov,  B. 
Rozovskii,  A.  Shiryaev),  World  Scientific  Publ.  Co.,  1997. 

[19]  M.  Joannides  and  F.  Le  Gland,  “Nonhnear  filtering  with  perfect  discrete  time  ob¬ 
servations,”  In  Proceedings  of  the  34th  Conference  on  Decision  and  Control,  New 
Orleans  1995,  pp.  4012-4017.  lEEE-CSS,  December  1995. 

[20]  M.  Joannides  and  F.  Le  Gland,  “Small  noise  asymptotics  of  the  Bayesian  estima¬ 
tor  in  nonidentifiable  nonlinear  regressions,”  In  Proceedings  of  the  2nd  Portuguese 
Conference  on  Automatic  Control,  Porto  1996,  pp.  257-261.  APCA,  September  1996. 

[21]  M.  Joannides  and  F.  Le  Gland,  “Nonhnear  filtering  with  continuous  time  perfect 
observations  and  noninformative  quadratic  variation,”  In  Proceedings  of  the  36th 
Conference  on  Decision  and  Control,  San  Diego,  1997,  pp.  1645-1650.  IEEE— CSS, 
December  1997. 

[22]  M.  Joannides  and  F.  Le  Gland,  “Small  noise  asymptotics  of  nonhnear  filters  with 
nonobservable  hmiting  deterministic  system,”  In  Proceedings  of  the  36th  Conference 
on  Decision  and  Control,  San  Diego,  1997,  pp.  1663-1668.  lEEE-CSS,  December 

1997. 


ARO  Grant  DAAH04-95-1-0164  (CAMS,  USC) 


Nonlinear  Filtering  28 


[23]  M.  Joaimides,  Navigation  Integree  d’un  Engin  Sous-Marin  Remarque.  Filtrage  Non- 
Lineaire  des  Systemes  avec  Contraintes  et/ou  Mesures  Parfaites.  Th^e  de  Doctorat, 
Universite  de  Provence,  Marseille,  March  1997. 

ftp : //ftp . inria . f r/INRIA/publicat ion/Theses/TU-0464 . ps . gz. 

[24]  G.  Jona-Lasinio  and  P.K.  Mitter,  “  Large  deviation  estimates  in  the  stochastic  quan¬ 
tization  of  $2)”  Commun.  Math.  Phys.,  vol.  130,  pp.  111-121,  1990. 

[25]  M.G.  Kendall  and  J.D.  Gibbons,  Rank  Correlation  Methods,  London,  1990. 

[26]  S.  Khgys  and  B.L.  Rozovskii,  “State  estimation  in  hidden  Markov  models  with  dis¬ 
tributed  observation”  (submitted). 

[27]  H.  Kimita,  “On  the  decomposition  of  solutions  of  stochastic  differential  equations,” 
In  D.  Williams,  editor.  Stochastic  Integrals,  Dmham  1980,  volume  851  of  Lecture 
Notes  in  Mathematics,  pp.  213-255.  Springer  Verlag,  Berlin,  1981. 

[28]  H.J.  Kushner,  “On  the  dynamical  equations  of  conditional  probabihty  density  func¬ 
tions,  with  applications  to  optimal  stochastic  control  theory,”  J.  Math.  Anal.  Appl, 
vol.  8,  pp.  332-344,  1964. 

[29]  H.J.  Kushner,  “On  the  differential  equations  satisfied  by  conditional  probabihty  den¬ 
sities  of  Markov  processes,”  SIAM  J.  Control,  vol.  2,  pp.  106-119,  1964. 

[30]  H.J.  Kushner,  Probability  Methods  for  Approximations  in  Stochastic  Control  and  for 
Elliptic  Equations,  vol.  129  of  Mathematics  in  Science  and  Engineering.  Academic 
Press,  New  York,  1977. 

[31]  H.J.  Kushner,  “Approximations  to  optimal  nonfinear  filters,”  IEEE  Transactions  on 
Automatic  Control,  vol.  AC-12,  pp.  546-556,  1967. 

[32]  Yu.  A.  Kutoyants,  Identification  of  Dynamical  Systems  with  Small  Noise,  volmne  300 
of  Mathematics  and  its  Applications.  Kluwer,  Dordrecht,  1994. 

[33]  H.  Kwakernaak  and  R.  Sivan,  Linear  Optimal  Control  Systems.  John  Wiley  &  Sons, 
New  York,  1972. 

[34]  F.  Le  Gland  and  L.  Mevel,  “Asymptotic  behaviour  of  the  MLE  in  hidden  Markov 
models,”  In  Proceedings  of  the  4th  European  Control  Conference,  Bruxelles  1997, 
July  1997.  Paper  FRA-F6. 

[35]  F.  Le  Gland  and  L.  Mevel,  “Exponential  forgetting  and  geometric  ergodicity  in  HM- 
M’s,”  In  Proceedings  of  the  36th  Conference  on  Decision  and  Control,  San  Diego 
1997,  pp.  537-542.  lEEE-CSS,  December  1997. 

[36]  F.  Le  Gland  and  L.  Mevel,  “Recursive  identification  in  HMM’s,”  In  Proceedings  of 
the  36th  Conference  on  Decision  and  Control,  San  Diego  1997,  pp.  3468-3473.  lEEE- 
CSS,  December  1997. 


ARO  Grant  DAAH04-95-1-0164  (CAMS,  USC) 


Nonlinear  Filtering  29 


[37]  S.L.  Leonov,  “Nonparametric  methods  for  clutter  removal”,  University  of  Southern 
California,  Center  for  Applied  Mathematical  Sciences,  1998,  Technical  Report  (in 
preparation). 

[38]  J.  Levine  and  R.  Marino,  “Constant-speed  target  tracking  via  bearings-only  mea¬ 
surements,”  IEEE  Transactions  on  Aerospace  and  Electronic  Systems,  vol.  AES-28, 
pp.  174-182,  January  1992. 

[39]  R.Sh.  Liptser  and  A.N.  Shiryayev,  The  Statistics  of  Random  Process,  I, II.  Springer- 
Verlag,  Berlin,  1977,  1978. 

[40]  S.V.  Lototsky  and  B.L.  Rozovskii,  “Recursive  multiple  Wiener  integral  expansion  for 
nonhnear  filtering  of  diffusion  processes.”  In:  Stochastic  Processes  and  Functional 
Analysis  (J.A.  Goldstein  et  al..  Editors),  Marcel  Dekker,  New  York,  pp.  199-208, 
1996. 

[41]  S.V.  Lototsky  and  B.L.  Rozovskii,  “Recursive  nonlinear  filter  for  a  continuous- 
discrete  time  model,”  IEEE  Trans.  Automatic  Cont,  (to  appear). 

[42]  S.V.  Lototsky,  C.  Rao,  and  B.L.  Rozovskii,  “Fast  nonlinear  filter  for  continuous- 
discrete  time  multiplication  models:  separation  of  parameters  and  observations,” 
Proceedings  of  the  35th  Conference  on  Decision  and  Control,  Kobe,  Japan,  1996,  vol. 
4,  Omnipress,  Madison,  Wisconsin,  pp.  4060-4064. 

[43]  S.V.  Lototsky,  R.  Mikulevicius,  and  B.L.  Rozovskii,  “Nonlinear  filtering  revisited:  a 
spectral  approach,”  Control  and  Optimization,  Vol.  35,  No.  2,  March  1997. 

[44]  S.V.  Lototsky,  C.  Rao,  and  B.L.  Rozovskii,  “Method  of  optimal  stochastic  filtering 
for  tracking  objects  with  possibly  nonlinear  dynamics,”  U.S.  Patent  (application). 

[45]  S.V.  Lototsky,  Problems  in  Statistics  of  Stochastic  Differential  Equations,  Ph.D.  The¬ 
sis,  University  of  Southern  California,  1996. 

[46]  S.V.  Lototsky  and  B.L.  Rozovsky,  “Parameter  estimation  for  stochastic  evolution 
equations  with  non-commuting  operators” ,  IMA  Preprint  Series  #1501,  August  1997. 

[47]  L.  Mevel,  Statistique  Asymptotique  pour  les  ModNes  de  Markov  Caches.  These  de 
Doctoral,  Universite  de  Rennes  1,  Reimes,  November  1997. 

ftp : //ftp . irisa . fr/techreport s/theses/1997 /mevel . ps . gz. 

[48]  R.  Mikulevicius  and  B.L.  Rozovskii,  “Martingale  problems  for  stochastic  PDE’s,” 
In  Stochastic  PDE’s:  Six  Perspectives,  R.  Carmona,  B.L.  Rozovskii,  Editors.  AMS, 
1998  (to  appear). 

[49]  R.  Mikulevicius  and  B.L.  Rozovskii,  “Parabolic  stochastic  PDE’s  and  Wiener  chaos,” 
SIAM  J.  Math.  Anal,  (submitted). 


ARO  Grant  DAAH04-95-1-0164  (CAMS,  USC) 


Nonlinear  Filtering  30 


0 


[50]  R.  Mikulevicius  and  B.L.  Rozovskii,  “Normalized  stochastic  integrals  in  topological 
vector  spaces,”  Seminaire  de  Probabilities  (to  appear). 

[51]  R.  Mikulevicius  and  B.L.  Rozovskii,  “Fourier-Hermit  expansion  for  nonlinear  filter¬ 
ing,”  In:  Liber  Amicorum  for  A.N.  Shiry ay ev  {to  appear). 

[52]  R.  Mikulevicius  and  B.L.  Rozovskii,  “  Soft  solutions  of  stochastic  PDF’s  and  Wiener 
Chaos,”  In  Stochastic  Analysis  on  Infinite  Dimensional  Spaces  (H.  Kunita  and  H.-H. 
Kuo,  Editors),  Pitman  Research  Notes  in  Mathematics  Series  310,  Longman,  1994. 

[53]  R.E.  Mortensen,  Optimal  Control  of  Continuous  Time  Stochastic  Systems,  Ph.D. 
Thesis,  University  of  California,  Berkeley,  1966. 

[54]  D.  Nualart  and  B.L.  Rozovskii,  “Weighted  stochastic  Sobolev  spaces  and  bilinear 
SPDE’s  driven  by  space-time  white  noise,”  J.  Funct.  Anal,  (submitted) . 

[55]  D.  Ocone,  “Multiple  integral  expansions  for  nonlinear  filtering,”  Stochastics,  vol.  10, 
pp.  1-30,  1983. 

[56]  G.  Parisi  and  Y.S.  Wu,  “Perturbation  theory  without  gauge  fixing,”  Sienta  Sinica, 
vol.  24,  pp.  383-496,  1981. 

[57]  J.  Picard,  “Efficiency  of  the  extended  Kalman  filter  for  nonlinear  systems  with  small 
noise,”  SIAM  Journal  on  Applied  Mathematics,  vol.  51,  pp.  843-885,  June  1991. 

[58]  I.S.  Reed,  R.M.  Gagliardi,  and  H.M.  Shao,  “  Application  of  three-dimensional  fil¬ 
tering  to  moving  target  detection,”  IEEE  Trans.  AES,  vol.  AES-19,  pp.  898-904, 
1983. 

[59]  B.L.  Rozovskii,  “A  simple  proof  of  uniqueness  for  Kushner  and  Zakai  equations,”  In 
Stochastic  Analysis,  (E.  Mayer-Wolf,  et  al..  Editors),  Academic  Press,  Boston,  pp. 
449-458,  1991. 

[60]  T.  Ryden,  “Consistent  and  asymptotically  normal  parameter  estimates  for  hidden 
Markov  models,”  The  Annals  of  Statistics,  vol.  22,  no.  4,  pp.  1884-1895,  December 
1994. 

[61]  T.  Ryden,  “On  recursive  estimation  for  hidden  Markov  models,”  Stochastic  Processes 
and  their  Applications,  vol.  66,  no.  1,  pp.  79-96,  1997. 

[62]  N.  Wiener,  “The  homogeneous  chaos,”  Am.  J.  Math.,  vol.  60,  pp.  897-936,  1930. 

[63]  N.  Wiener,  Extrapolation,  Interpolation  and  Smoothing  of  Stationary  Time  Series. 
Wiley,  New  York,  1949. 

[64]  M.  Zakai,  “On  the  optimal  filtering  of  diffusion  processes,”  Zeitschrift  fiir 
Wahrscheinlichkeitstheorie  und  verwandte  Gebiete,  vol.  11,  no.  3,  ,pp.  230-243,  1969. 


